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Higgs boson pair production at the LHC in the bbW + W channel 
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We consider Higgs boson pair production at the LHC in the bbW + W~ channel, with subsequent 
decay of the W + W~ pair into tvjj. Employing jet substructure and event reconstruction techniques, 
we show that strong evidence for this channel can be found at the 14 TeV LHC with 600 fb -1 of 
integrated luminosity, thus improving the current reach for the production of Higgs boson pairs. 
This measurement will allow to probe the trilinear Higgs boson coupling A. 
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Introduction. One of the aims of the LHC is to search 
for the agent of electroweak symmetry breaking (EWSB), 
which in its minimal form is the Standard Model (SM) 
Higgs boson (h). Recently, both the ATLAS and the 
CMS collaborations have observed a new state with a 
mass of about 125 GeV, whose properties are in sub- 
stantial agreement with the SM Higgs boson pQ. The 
quest for understanding the mechanism behind EWSB 
does not end with the discovery of the Higgs boson. It 
is crucial to test the Higgs boson potential to its full 
extent, measuring the couplings of the Higgs boson to 
gauge bosons and matter fields, and also to probe its 
self interactions. After EWSB, the Higgs potential can 
be written as V(h) = m 2 h h 2 /2 + Xvh 3 + A/i 4 /4. In the 
SM, A = A = (ml/2v 2 ) w 0.13 for m h =l25 GeV. With 
an extended Higgs sector, as is common in many new 
physics models beyond the SM, these couplings will de- 
viate from the SM values. Therefore, measuring these 
two couplings is very important to reveal the true nature 
of the Higgs boson. At the LHC, the quartic coupling A 
may be probed via triple Higgs boson production. How- 
ever, its tiny cross section [2] makes it very difficult, if 
not impossible, to do so. On the other hand, the trilinear 
coupling A can be measured with Higgs boson pair pro- 
duction, pp — ¥ hh, which may be discovered at a large 
luminosity phase of the LHC. In the following we will 
focus on that possibility. 

The discovery potential of Higgs boson pair production 
at the LHC has been studied in [3J 0] ■ Ref . [3] concen- 
trated on the decay channels hh — > 6677, bbp + fi~ , finding 
that with 600 fb -1 one expects 6 signal and 11 back- 
ground events, giving a significance of about 1.5a. In the 
recent years, jet substructure has been shown to be very 
important when dealing with hadronic decays of heavy 
particles [5] . In the h — ¥ bb case, this was put forward in 
the seminal paper by Butterworth, Davison, Rubin and 
Salam (BDRS) [5] in the context of Wh and Zh produc- 
tion, which were previously considered as challenging to 
probe at the LHC. With the subjet techniques, BDRS 
have shown that this can become a very promising dis- 
covery channel for the Higgs boson. Ref. [4] employed 
these new techniques, and assuming good r reconstruc- 
tion efficiency (~ 80%), the authors claimed the bbr + T~ 



channel as the most promising one, with 57 signal and 
119 background events at 600 fb -1 . 

In both [3 and [4j, the hh -> bbW+W- -> bbtvjj 
channel was considered less promising, due to the large 
ti background. In this Letter, we apply the BDRS tech- 
niques to this final state in conjunction with event recon- 
struction using mass-shell constraints, assuming that the 
Higgs boson mass is well-measured. We show that in the 
highly boosted regime, the reconstruction of both Higgs 
bosons present in the event allows us to distinguish the 
signal and background, thereby turning this channel into 
a potentially significant contribution in the discovery of 
Higgs boson pair production. 

Higgs pair production and decay. The main pro- 
duction mechanism for Higgs boson pairs at the LHC is 
gluon fusion, which was studied at leading order (LO) in 
quantum chromodynamics (QCD) in 0[H]. Other pro- 
duction modes such as qq — > qqhh, Vhh, tthh are a factor 
of 10-30 smaller [9J [TO] , and therefore we do not consider 
them in the rest of our analysis. 

We employ the code HPAIR [TT] to compute the pro- 
duction cross section, which implements the next-to- 
leading order (NLO) QCD corrections obtained in the 
heavy top quark limit |12j . We have modified the public 
version of HPAIR in order to use the up-to-date parton 
distribution functions (PDFs) present in the LHAPDF 
library [13]. For the LO and NLO cross sections, we 
employ CTEQ6L1 and CT10 [H] PDF sets with the 
corresponding values of a s , respectively. We adopt 
the pole masses for the top and bottom quarks to be 
m t = 174.0 GeV and m b = 4.5 GeV. For a 125 GeV 
Higgs boson, we have obtained an NLO cross section of 
32.3^47 fb, where the uncertainty reflects the variation 
of the renormalization and factorization scales fi r = pf 
around the central value /j,q by a factor of 2, with ^0 being 
the Higgs boson pair invariant mass. In the left panel of 
Fig. [T] we show the scale variation of the production cross 
sections at LO and NLO. One can observe that there is 
a large K-factor (~ 2) on the cross section, and that the 
scale uncertainty is still high (about 20%). Either an ex- 
act NLO calculation, the NNLO computation in the large 
top quark mass limit or performing QCD resummation 
could help reduce the scale uncertainty. 
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FIG. 1. (a) Scale variation of gluon fusion cross section for 
Higgs boson pair production, at LO and NLO. (b) cross sec- 
tions times branching ratios at the 14 TeV LHC, for Higgs 
boson pair production. We show only the dominant decay 
modes. : bbbb (dots), bbjj (short dashes), bbbjjjj (dot- 
dashes), bbr + T~ (long dashes) and bblvjj (solid). Note that 
the four main decay modes are fully hadronic. 

For the branching ratios, the values of [15] were used. 
In the mass range (120, 130) GeV, the Higgs boson decay 
modes with the largest branching fractions are h — > bb 
and h —> W + W~ . The most probable decay mode for a 
pair of Higgs bosons is hh —> bbbb. This mode is chal- 
lenging to search for, mostly due to the fact that it is 
difficult to trigger on, and that it competes against the 
QCD multi-jet backgrounds that possess overwhelmingly 
large cross sections. In general, QCD backgrounds can 
be suppressed with the existence of leptons and missing 
energy. We plot in the right panel of Fig. [T] the total 
rates for the five most important channels at the 14 TeV 
LHC, with the Higgs mass in the 120-130 GeV range. As 
can be seen, the first four channels are purely hadronic. 
The most important channel that contains leptons and 
missing energy is bbW + W~ with W + W~ — > tvjj, where 
I is either an electron or a muon and j refers to light jets. 
For a 125 GeV Higgs boson, the branching ratio for this 
mode is ~ 7.25%, and the total rate is ~ 2.34 fb. 
Event generation and analysis. We now describe our 
analysis strategy for the bbtvjj channel. We will focus 
on a 'mid-term' integrated luminosity of 600 fb^ 1 for the 
LHC at a center-of-mass energy of 14 TeV. The largest 
background for this final state is tt production with semi- 
lcptonic decay of the top pair. This background is the 
most challenging one: not only it has a large total rate 
(~ 240 pb), but also possesses a mass scale, given by 
the top mass (~ 175 GeV). The second important back- 
ground is W(—> ^<v)66+jets, with a total rate of ~ 2.17 pb. 
Other QCD multi-jets production associated with a W 
boson can enter, with two light jets misidentified as com- 
ing from 6-quarks. Backgrounds originating from asso- 
ciated production of a single Higgs boson can also be 
present: h(—> WW)bb, h(—> bb)WW and /i+jets where 
the jets are miss-identified. 

Parton-level events of the hh signal, with the Higgs bo- 
son mass set to 125 GeV at the 14 TeV LHC have been 



generated using a custom MadGraph 5 model [THl IT7] . 
which includes the full top quark mass effects in the rel- 
evant box and triangle diagrams. The factorization and 
renormalization scales are set to fj,p = fj,R = 125 GeV, 
and we checked that other scale choices do not substan- 
tially alter the conclusions of our analysis. The decays of 
the Higgs bosons are performed in HERWIG++ [THUS], an d 
the total rate is normalized to the NLO value of 2.34 fb. 
The tt background is generated using HERWIG++ with sub- 
sequent semi-leptonic decay, whose cross section is nor- 
malized to the approximate NNLO value (times branch- 
ing ratio) of 240 pb [20] • Parton-level events for other 
backgrounds are generated using ALPGEN 21J, where the 
transverse momenta of light partons or 6-quarks were 
constrained to be pr > 30 GeV and their separation sat- 
isfies AR = ^{Ay) 2 + {A4>) 2 > 0.35, with y and <j> be- 
ing the rapidity and azimuthal angle, respectively. The 
parton-level events are then showered and hadronized via 
HERWIG++. Whenever applicable, MLM-matching [21] as 
implemented in HERWIG++ [JJ5] is used to avoid double- 
counting in certain regions of phase space. 

The hadron-level particles satisfying pt > 0.1 GeV and 
\r)\ < 5 are clustered into jets with the Cambridge- Aachen 
algorithm using Fast Jet |22j . with a radius parameter 
R = 1.4. We then pick those jets with p T > 40 GeV, 
which results in what we call 'fat' jets. For a given fat jet 
j, we then examine its subjets j\ and j2 (with rrij 1 > rrij 2 ) 
following the BDRS [6] procedure. We ask for a signif- 
icant mass drop mj 1 < imij with /i = 0.667, and re- 
quire that the splitting is not too asymmetric by impos- 
ing min(p^ i ,p^ J - 2 )Ai?| i j2 /m 2 > 0.09. We also apply a 
'filtering' procedure similar to that applied by BDRS: re- 
solving the fat jets on a finer angular scale Rmt < Rji,j 2 
and taking the three hardest objects (subjets) that ap- 
pear, where we choose Ran — min(0.35, Rj lt j 2 /2). This 
provides versatility to the analysis against the effects of 
extra radiation, particularly the underlying event. In the 
present study we do not consider the effects of the detec- 
tor resolution, which of course have to be included in a 
detailed experimental study. 

We look for events containing at least two filtered fat 
jets satisfying the mass drop condition. We then impose 
the following conditions: 

1. Exactly one isolated lepton with p T ^ > 10 GeV and 

< 2.5, where isolation means that the scalar sum 
of the transverse momenta of the visible particles 
lying inside a cone of radius R = 0.15 around the 
lepton is less than 0.1 x pr.i- 

2. Missing transverse energy $t > 10 GeV. 

3. At least one fat jet with its two leading subjets b- 
tagged, which satisfies \r)\ < 2.5 and pr > 180 GeV. 
Among these we take the one with highest p T as the 
h —> bb candidate and refer to it as hi . The system 
of the two &-tagged subjets is referred to as bb. 
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Process 


^initial (fb) 


Obasic (fb) 


hh — > bbiujj 


2.34 


0.134 


ti — > bblvjj 

£is)bb+jets 


240 x 10 3 
2.17 x 10 3 


15.5 
0.97 


W{-+ &/)+jets 


2.636 x 10 6 


0(0.01) 


£fjj)+]ets 

fc(-> 

66) + WW(-+ lujj) 


36.11 

6.22 

0.0252 


0(0.0001) 
0(0.001) 



TABLE I. Cross sections for the signal and backgrounds be- 
fore (second column) and after (third column) the 'basic' cuts. 
For the irreducible backgrounds where true 6-quarks are not 
present, a miss-6-tagging probability of 1% for light jets are 
included. The MLM-matching is applied to the VK66+jets, 
IF+jets and fe+jets processes. 



4. A second fat jet with pr > 40 GeV and m > 5 GeV, 
which, together with the lepton and $t, can recon- 
struct the IU-decaying Higgs boson (/i 2 ). This jet 
will be considered as candidate for the hadronically 
decaying W boson, and will be referred to as Wh- 
in the above, 6-tagging is implemented in the event gener- 
ators by keeping the lightest B-hadrons stable. Through- 
out this work we assume a 6-tagging efficiency of 70%. 
The reconstruction of the H^-decaying Higgs boson is 
achieved by solving the set of equations m h = {pi + p„ + 
Pw h ) 2 and pi — 0, where the transverse components of 
p v are identified with those of the missing transverse mo- 
mentum. Here we assume that the mass of the Higgs 
boson will already have been measured to a reasonable 
accuracy. Note that since the equations are quadratic, 
there are two solutions for the z-component of momen- 
tum of the neutrino. It is, however, not possible to decide 
which is the correct one and we therefore do not use this 
information in our analysis. Here we reject events giving 
complex solutions, although one may adopt some imag- 
inary part 'tolerance' to accommodate the smearing of 
the momenta by detector effects [23 . 

The conditions described above will be referred to as 
the 'basic' cuts, and already provide strong rejection 
against backgrounds. Table [T] shows the starting cross 
sections for the processes considered as well as the re- 
sulting cross sections after the 'basic' cuts. Among the 
irreducible backgrounds where the final states are exactly 
the same as our signal, the important ones are ti and 
Wbb+jets, which we will further analyze, while the hbb 
and hWW processes are negligible. The VF+jets back- 
ground requires two miss-6-tagged light jets to fake our 
signal. We estimate the rejection factor as follows: for the 
W+jets inclusive sample, we pick the hardest filtered fat 
jet and, assuming that its two hardest filtered subjets are 
miss-6-tagged, we apply the 'basic' cuts to the event. We 
multiply the resultant cross section by the light jet rejec- 
tion factor (10 -4 , assuming the light jet miss-6-tag prob- 
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FIG. 2. Distributions for signal and backgrounds of (a) pr,h t 
after the basic cuts; and (b) R b % hl , (c) rrih 1 , (d) mw h after 
the basic cuts and pt,h.i > 240 GeV. 



ability to be 1%) for two jets. The /i+jets background 
also requires miss-6-tags, for which we work in the same 
way as with the W+jets. These reducible backgrounds 
are found to be irrelevant after the 'basic' cuts. 

We investigate in further detail the hh signal versus 
the ti and Wbb+jets backgrounds, going beyond the 'ba- 
sic' cuts. We show the signal (S) and background (B) 
distributions to demonstrate the set of cuts that provides 
a high significance, while retaining a reasonable number 
of signal events in order to keep the statistical error un- 
der control. We show in Fig. [2^ a) the prjn distributions, 
where we see that the signal tends to have a larger pt 
for the Higgs candidate. We therefore impose a harder 
cut ptm > 240 GeV and subsequently consider the (b) 
R b l hl (distance between the hi fat jet and the bb sub- 
system), (c) and (d) m^ h distributions. One can 
observe that significant background rejection can be ob- 
tained by selecting mw h around the W boson mass mw, 
requiring that the b and b subjets are more symmetri- 
cally distributed in the fat jet hi by choosing a small 
R b l h , and imposing a mass window for around 
the true Higgs mass rrih- We choose mw h > 65 GeV, 
m hl G [120 - 130] GeV and R bl hi < 0.06. Using these 
simple cuts, we obtain about 4.6 signal and 2.6 back- 
ground events at 600 fb -1 , thus getting S/y/S + B ~ 1.7, 
and a significance of 2.2<r. To gain more discriminating 
power, we explored in more detail the kinematic distribu- 
tions of the various objects. While a cut-based method 
is possible (we managed to achieve 2.5cr with S ps 4 and 
B w 1), we performed a more dedicated multivariate 
analysis for that purpose. To this end we employ the 
boosted decision tree (BDT) method |Mj implemented 
in the ROOT TMVA package In addition to our previ- 
ous set of variables, we add the following: pr,h 2 i PT,w h , 
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FIG. 3. Outputs of BDT analysis. Left: background re- 
jection vs. signal efficiency. Right: normalized signal and 
background distributions against BDT response. 

PTMh-2, Rh!,w h , M Tliv , A<f>e t „, A<p WuWh , where Wi refers 
to the leptonically decaying W boson, and the trans- 
verse mass of the lcpton and neutrino system is defined 
as M| A = (E T ,i + E T . M ) 2 - (pr ti + vt, v ) 2 - 

We trained 1000 decision trees, from which the outputs 
are shown in Fig. [3] where we can see that one can ob- 
tain good discrimination between signal and background. 
We find that when cutting at a value of around 0.1, we 
can obtain S/ \JS + B ~ 2.4 and a significance of 3.1er, 
with S ~ 9 and B sa 6. We have checked that the inclu- 
sion of underlying event for the signal sample does not 
bring down the significance substantially. Further im- 
provement can be obtained if one consider the tauonic 
decays of the W bosons in both signal and background. 
Assuming a t reconstruction efficiency of ~ 70%, one 
can obtain an increased significance of 3.6 (3.0) using 
the BDT (cut-based) analysis. 

Conclusions. We have studied the prospects of detect- 
ing Higgs boson pair production at the 14 TeV LHC 
in the bbtvjj channel, where t is either a muon or an 
electron. Our analysis is based on exploiting jet sub- 
structure techniques to identify the h — > bb decay for 
a Higgs boson in the boosted regime as a fat jet, and 
also event reconstruction for the h — » W + W~ decay. In 
spite of the very tiny initial signal to background ratio, 
we have identified a few useful kinematic variables that 
allow to discriminate signal from background. By cut- 
ting on these variables one can achieve an 0(1) signal to 
background ratio, although retaining only a few handful 
of events for 600/fb. Further increase in the sensitivity 
can be achieved by including several more variables into 
the analysis. Given that scenario, we turned to a mul- 
tivariate boosted decision tree analysis, which allows to 
obtain a significance of about 3tr while retaining a larger 
number (about 10) of signal events. Furthermore, the 
significance can be enhanced if we consider tau leptons 
in the final state, allowing to obtain just under 4er of 
sensitivity. This channel will make an important contri- 
bution, in combination with the already studied bbr + r~ 
and 6677, final states, towards the discovery of Higgs pair 
production at the LHC, and measuring the trilinear self 



interaction. 
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